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Abstract — We study error-correcting codes for permutations 
under tiie infinity norm, motivated by a novel storage sclieme 
for flasli memories called rank modulation. In this scheme, a set 
of n flash cells are combined to create a single virtual multi- 
level cell. Information is stored in the permutation induced by 
the cell charge levels. Spike errors, which are characterized by 
a limited-magnitude change in cell charge levels, correspond to 
a low-distance change under the infinity norm. 

We define codes protecting against spike errors, called limited- 
magnitude rank-modulation codes (LMRM codes), and present 
several constructions for these codes, some resulting in optimal 
codes. These codes admit simple recursive, and sometimes direct, 
encoding and decoding procedures. 

We also provide lower and upper bounds on the maximal 
size of LMRM codes both in the general case, and in the case 
where the codes form a subgroup of the symmetric group. In 
the asymptotic analysis, the codes we construct out-perform the 
Gilbert- Varshamov-like bound estimate. 

Index Terms — flash memory, rank modulation, asymmetric 
channel, permutation arrays, subgroup codes, infinity norm 

I. Introduction 

IN the race to dominate non-volatile information-storage de- 
vices, flash memory is a prominent contender. Flash mem- 
ory is an electronic non-volatile memory that uses floating-gate 
cells to store information [7]. While initially, flash memory 
cells used to contains a single bit of information, in the 
standard multi-level flash-cell technology of today, every cell 
has q > 2 discrete states, {0,1, . . . ,q — 1}, and therefore can 
store log2 q bits. The flash memory changes the state of a cell 
by injecting (cell programming) or removing (cell erasing) 
charge into/from the cell. 

Flash memories possess an inherent asymmetry: writing is 
more time- and energy-consuming than reading [7]. The main 
reason behind this asymmetry is the iterative cell-programming 
procedure designed to avoid over-programming [2] (raising 
the cell's charge level above its target level). While cells 
can be programmed individually, only whole blocks (today, 
containing approximately 10^ cells, see [7]) can be erased 
to the lowest state and then re-programmed. Since over- 
programming can only be corrected by the block erasure, in 
practice a conservative procedure is used for programming a 
cell, where charge is injected into the cell over quite a few 
rounds [2]. After every round, the charge level of the cell 
is measured and the next-round injection is configured. The 
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charge level of the cell is made to gradually approach the 
target state until it achieves the desired accuracy. The iterative- 
programming approach is costly in time and energy. 

Another major concern for flash memory is data reliability. 
The stored data can be corrupted due to charge leakage, a 
long-term factor that causes the data retention problem. The 
data can also be affected by other mechanisms, including 
read disturbance, write disturbance [7], etc. Many of the error 
mechanisms have an asymmetric property: they make the cells' 
charge levels drift in one direction. (For example, charge 
leakage makes the cell levels drift down.) Such a drift of cell 
charge levels causes errors in aging devices. The problem of 
data corruption is further aggravated as the number of levels 
in multi-level cells increases, since this reduces the safety 
margins for correct reading and writing. 

To address these issues, the rank-modulation scheme has 
been recently suggested [17]. By removing the need to mea- 
sure absolute cell-charge levels, the new scheme eliminates 
the risk of cell over-programming, and reduces the effect 
of asymmetric errors. In this scheme, a virtual cell that is 
composed of n cells with distinct charge levels, induces a 
permutation which is used to represent the stored information. 
Each cell has a rank which indicates its relative position when 
ordering the cells according to descending charge-level. The 
ranks of the n cells induce a permutation of {1,2, ... ,n}. 

When writing or reading the cell charge levels, we only need 
to compare the charge levels between cells. Thus, the rank- 
modulation scheme eliminates the need to use the absolute 
values of cell levels to store information. Since there is 
no risk of over-programming and the cell charge levels can 
take continuous values, a substantially less conservative cell 
programming method can be used and the writing speed can be 
improved. In addition, asymmetric errors become less serious, 
because when cell levels drift in the same direction, their ranks 
are not affected as much as their absolute values. This way 
both the writing speed and the data reliability can be improved. 

While the rank-modulation scheme alleviates some of the 
problems associated with current flash technology, the flash- 
memory channel remains noisy and an error-control mecha- 
nism is required. In this work we consider an error model 
which corresponds to spike errors. Such errors are charac- 
terized by a limited-magnitude change in the charge level of 
cells, and readily translates into a limited-magnitude change 
in the rank of, possibly, all cells in the stored permutation. 
This corresponds to a bounded-distance change under the Geo- 
metric. We call codes protecting against such errors limited- 
magnitude rank-modulation codes, or LMRM-codes. 

A similar error model for flash memory was considered not 
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in the context of rank modulation in [8], while a different 
error-model (charge-constrained errors for rank modulation) 
was studied in [18]. Codes over permutations are also referred 
to as permutation arrays and have been studied in the past 
under different metrics [3], [4], [9], [10], [14], [15], [28]. 
Specifically, permutation arrays under the £co-metric were 
considered in [24]. 

The main contribution of this paper is a set of constructions 
and bounds for such codes. The constructions presented are 
applicable for a wide range of parameters, and admit simple 
decoding and encoding procedures. We also present bounds 
on code parameters both for the general case, as well as for 
the more restricted case of subgroup codes. Most notably, we 
present an asymptotically-good family of codes, with non- 
vanishing normalized distance and rate, which exceed the 
Gilbert- Varshamov-like lower bound estimate. 

It is important to note that, independently and concurrently, 
Kl0ve, Lin, Tsai, and Tzeng [20] describe Construction [T] and 
its immediate generalization. Construction |2] As the overall 
overlap is small, and since the two constructions lead to our 
Construction [3] which we show to produce an optimal code, 
we bring these first two here for the sake of completeness. 

The rest of the paper is organized as follows. In Section 
im we define the notation, and introduce the error-model as 
well as the associated foo-metric. We proceed in Section |III] 
and present the code constructions and encoding/decoding 
algorithms. In Section |IV] we investigate general bounds on 
LMRM codes, code-anticode bounds, and asymptotic-form 
bounds. We conclude in Section |V] with a summary of the 
results and a short concluding remarks. 

II. Definitions and Notations 

For any ffi, n G N, m ^ n, let [ni, n] denote the set 
{m, m + 1,. . ., n}, where we also denote by [n] the set [1, n]. 
Given any set A of cardinality n, we denote by the set of 
all permutations over the set A. By convention, we use S„ to 
denote the set S[„]. 

We will use both the vector notation for permutations 
f E S„, where / = \Ji, fi, ■ ■ ■ , fn\ denotes the permutation 
mapping i i— > /; = f{i) for all i G [n], and the cycle notation, 
where / = {fi, fi, ■ ■ ■ , ft) denotes the permutation mapping 
// ^ fi+l for iE[k — 1] as well as //^ /i. Given 
two permutations f,gE Sn, the product fg is a permutation 
mapping i i— > f{g{i)) for all i e [n]. 

Let us consider n flash memory cells which we name 
l,2,...,n. The charge level of each cell is denoted by C; G ]R 
for all i G [n]. In the rank-modulation scheme defined in [17], 
the charge levels of the cells induce a permutation in the 
following way: The induced permutation (in vector notation) 
is [fiji, ■ ••,/«] iff Cf^>Cf^> ■■■> Cf,^. 

The rank-modulation scheme is defined by two functions: 
an encoding function E : Q — > Sn, which takes a symbol 
from the input alphabet aG Q and maps it to a permutation 
/ = E(fl) G S„, and a decoding function D : S„ ^ Q. Since 
no channel is devoid of noise, a stored permutation / = E{a) 
may be corrupted by any of a variety of possible disturbance 
found in flash memory (see [7]). Assuming the changed 



version of /, denoted /', is not too corrupted, we would 
like the decoding function to restore the original information 
symbol, i.e., D(/') = a. 

For a measure of the corruption of a stored permutation 
we may use any of a variety of metrics over S,, (see [12]). 
Given a metric over Sn, defined by a distance function d : 
S„ X S„ ^ N U {0}, an error-correcting code is a subset of 
S,i with lower-bounded distance between distinct members. 

In [18], the Kendall-T metric was used, where the distance 
between two permutations is the number of adjacent transpo- 
sitions required to transform one into the other This metric 
corresponds to a situation in which we can bound the total 
difference in charge levels, and the error-correcting codes are 
therefore named charge-constrained rank-modulation codes. 

In this work we consider a different type of common error - 
a hmited-magnitude spike error. Suppose a permutation / G Sn 
was stored by setting the charge levels of n flash memory 
cells to Ci,C2, . . .,c„. We say a single spike error of limited- 
magnitude L has occurred in the f-th cell if the corrupted 
charge level, c^, obeys | c,- — | ^ L. In general, we say spike 
errors of limited-magnitude L have occurred if the corrupted 
charge levels of all the cells, c'^,C2, . ■ . ,c'^, obey 



max Cj 

i e \n] 



^ L. 



Let us denote by /' the permutation induced by the cell 
charge levels Cj, Cj, . . . , c'^ under the rank-modulation scheme. 
Under the plausible assumption that distinct charge levels 
are not arbitrarily close (due to resolution constraints and 
quantization at the reading mechanism), i.e., |c/ — | ^ £ 
for some positive constant £ G IR for all i ^ j, a spike error 
of limited-magnitude L implies a constant d G N such that 



max 

ie[n] 



< d. 



Loosely speaking, an error of limited magnitude cannot change 
the rank of the cell i (which is simply f^^{i)) by d or more 
positions. 

We therefore find it suitable to use the ^oo-metric over S„ 
defined by the distance function 



doo{f,g) 



I e n] 



for all f,ge Sn- Since this will be the distance measure used 
throughout the paper, we will usually omit the oo subscript. 

Definition 1. A limited-magnitude rank-modulation code 
(LMRM-code) with parameters {n,M,d), is a subset C C Sn 
of cardinality M, such that doa (/, g) ^ d for all f,gEC, 
f ^ g- (We will sometimes omit the parameter M.) 

We note that unlike the charge-constrained rank-modulation 
codes of [18], in which the codeword is stored in the per- 
mutation induced by the charge levels of the cells, here the 
codeword is stored in the inverse of the permutation. 

It may be the case that the code C forms a subgroup of 
the symmetric group S„, which we will denote by C ^ S„. 
We shall call such a code a subgroup code. Since groups offer 
a rich structure, we will occasionally constrain ourselves to 
discuss subgroup codes. 
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III. Code Constructions 

In this section we describe three constructions for LMRM 
subgroup codes. The first two were discovered independently 
and concurrently by [20]. We begin our constructions with 
the following, which bears a resemblance to the unidirectional 
limited-magnitude codes described in [1]. This construction 
will turn out to be a simple case of a more general construction 
given later. 

Construction 1. Given n, d G IN we construct 

C={feSn\fii) = i (modd)}. 
Alternatively, for every i G [d\ let 

Ai = {dZ + n [n] = {; e [n] | ; = f (mod d)} , 

and define C to be the direct product of the symmetric groups 
over the A: 's, 



C = Saj X 



Theorem 2. The code C from Construction\l\is an {n,M,d)- 
LMRM code with 

M = ( \n/d] !)" ( [n/d\ ''^ . 

Proof: The length and size of the code are easily seen 
to be as claimed. All we have to do now is show that the 
minimal distance of the code is indeed d. Let f,g^C he two 
distinct codewords, and let i G [n] be such that f{i) 7^ ^(0- 
Since f{i) = (mod d) it follows that |/(f) - g{i) \ ^ d, 
and so d{f,g) ^ d. ■ 
This construction allows a simple encoding procedure. To 
simplify the presentation let us assume that d divides n. The 
encoder takes as input an integer m G [0, M — 1] (where M 
is the size of the code), e.g., by translating from a string of 
[log2 MJ binary input symbols. The number M can then be 
written in base (n/d)\, that is 

d-l 

M= J^niiiin/dy.y, 

where ^ ffZ; ^ {n/d)\ — 1. Finally, for every i we map the 
z-th digit, )«,, to S/\.^^ using some function 

J^i:{0,l,...,{n/d)\-l}^SA,^,. 

There are numerous efficiently-computable functions to satisfy 
jTj, such as the factoradic representation (see [22], [23], [26]), 
as well others (see [21] and references therein). Then, by using 
{!Fo,!Fi, ...,!Fil_l} the resulting encoding becomes 

m Toimo) X j^i(mi) X ■ ■ ■ X T^-iimd-i)- 

A straightforward decoding procedure is also obtainable. Let 
us assume that f&C was stored, where C is an {n,M,d)- 
LMRM code from Construction [T] while the retrieved per- 
mutation was /' G S;,. We further assume that the maximum 
magnitude of errors introduced by the channel is [{d — 1) /2J , 
i.e., 1/(0-/(01 ^ lid-l)/2\ for all ie[n]. 

Since C is a code of minimum distance d, there is a 
unique codeword /* at distance at most [{d — 1)/2J from 



/'. Recovering this codeword is simple and may be done in- 
dependently for each of the coordinates: For every coordinate 
/ G [n], there is a unique f* G [n] such that \ f* - f'{i) \ ^ ^ 
and f* = i (mod d). The recovered permutation /* G S„ 
is given by /*(;') = /*. By definition, /* G C, and by the 
algorithm presented we also have d (/*,/') ^ [(d — 1)/2J, 
hence f* = f which is the original permutation which was 
stored. 

Finding the original input message may be accomplished by 
decomposing / G C into a product of permutations from S^i 
and applying JTr^ appropriately. 

We now extend the direct-product approach and generalize 
the previous construction. First we introduce a new notation. 
Given f E S,,, and a set A C N of size n, we denote by /a 
the same permutation but over A. More formally, assuming 
A = {flj, fl2/ • ■ • /flfi}, with fli < fl2 < ■ ■ ■ < we set 

/a = [«/(l)'«/{2)' ■•■'«/(«)]• 

Furthermore, given a set C C S«, we define 

Ca^Ua l/eC}. 

Construction 2. Let n,kEN, and define the sets 

Ai = {kZ + i) n [n], 

for all i G [k] . Furthermore, for all i G [k] let O be an 
{yii, Mi, di)-LMRM code, with rii = |A/|. We construct the 
code C C Sn, 



C — ^Ai X ^A2 X 



Theorems. The code C from Construction^ is an {n,M,d)- 
LMRM with M = nf=i M/, and d = min, ^ [^j fcd/. (By 
convention, the distance of a code with one codeword is defined 
as infinity.) 

Proof: Again, the length and size of the code are easily 
verified. In addition, given /, ^ G C, / 7^ ^, it is easy to see 
that f{i) — g{i) is a multiple of k, for any i G [n], and so the 
distance of each of the constituent codes is scaled by k, giving 
the desired result. ■ 
Before describing the next construction we briefly observe 
some properties which may be thought of as analogues to 
the case of linear subspace codes. The metric defined by dco 
over Sn is a right invariant metric (see [12]), i.e., for any 
f,g,heS„, 

doo{f,g) =doo{fh,gh). 

We can then define the weight of a permutation f E Sn as 

wt(/) = dco (/,/), 

where i denotes the identity permutation. Thus, for any C ^ 
Sn, an (m, d)-LMRM subgroup code, it follows that 

d = min wt( f). 

For convenience, given a set H C S«, we denote 

d(H) = min d{f,g) 

f,geH,f^g 

d{H) = max d{f,g) 
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Finally, we recall the following notation: For H,K C S„ we 
denote 



H 



K 



h" = khk- 



heH,keK 



}■ 



Constructions. Let H and K be subgroups of Sn such that 
= H and HHK = {/}. We construct the code C from the 
following semi-direct group product, 

C = H >4K'^ HK= {hk\heH,keK} . 

Theorem 4. The code from Construction^ is an {n,M,d)- 
LMRM subgroup code with M = \H\\K\ and 

d ^ max\d{H) -d{K),d{K)-d{H)^ . 

Proof: It is well known (see for example [16]) that if 
H'^ = H and Hn X = {/} then HK = KH Sn and 
\HK\ = \H\ \K\. Given he H and keK, where hk ^ i, then 
from the triangle inequality 

d{L,hk) ^ d[L,k) - d[hk,k) = wt(fc) - wt(/z) 
^ d{K)-d{H). 

Interchanging h and k gives the other lower bound. ■ 
The lower bound on the distance given in TheoremlH which 
we shall call the design distance, is often not tight as is shown 
in the following example. 

Examples. Let us construct an LMRM code of length n = 
6 and distance d = 3. According to construction\l\ the code 
S2 X S2 X S2 is a {6,8, 3)-LMRM code. 

We can improve this by looking at the code C3 ^ S3 defined 

by 

C3 = {[1,2,3], [2,3,1], [3, 1,2]}, 

i.e., the cyclic group of size 3, which is a (3, 3, 2) -LMRM code. 
By Construction^ the code C3 x C3 is a (6, 9, 4) -LMRM code, 
providing us a larger code than the previous one, with a larger 
distance. 

Finally, let us define K ^ Sg, a {6, 2, 5) -LMRM code, as 

K={[1, 2, 3, 4, 5, 6], [6, 5, 4, 3, 2, 1] } . 

It may be verified that H = C3 x C3 and K can be used with 
Construction^ resulting in a (6, 18, 3) -LMi?M code. We note 
that while the design distance guaranteed by Theorem^ is just 
1, the resulting distance of the code is actually 3. □ 



IV. Bounds 



A. General Bounds 



The first two bounds we present are the obvious analogues 
of the Gilbert- Varshamov bound, and the ball-packing bound 
(see, for example, [25]). We first define the ball of radius r 
and centered about / G S„ as the set, 

BrAf) = {g^S„ I d{f,g)i:r}. 

As mentioned before, the £co metric over Sn is right invariant, 
and so the size of a ball depends only on r and n, and not 
on the choice of center We will therefore denote by | Br,n \ the 
size of a ball of radius r in S„. 



Theorem 6. Let n, M, and d, be positive integers such that 
\Bd-i,n\ M ^ nl. Then there exists an {n, M, d) -LMRM code. 

Proof: Consider the following procedure: We start with 
the entire set Sn with all permutations unmarked, as well 
as an empty code C. At each step we choose an unmarked 
permutation, /, add it to C, and mark the ball B^„i„(/). 
We stop when there remain no unmarked permutations. The 
resulting code has minimal distance at least d, and the number 
of iterations (which equals the size of the code) is at least 

Next is a ball-packing bound, which was already mentioned 
in [24], and which we bring for completeness. 

Theorem l.LetC be an [n, M, d) -LMRM code. Then 



[{d-l)/l\,n 



M ^ nl. 



Proof: Since £co over S,, is a metric, and by the definition 
of an (n,M,d) -LMRM code, the balls of radius [(d-l)/2j 
centered about the codewords of C are disjoint, proving the 
claim. ■ 

We now proceed to present two upper bounds which are 
stronger, in general, than the ball-packing bound of Theorem|7] 
The first pertains to subgroup codes, while the second is more 
general. Before starting, we recall some well-known results 
from group theory (see [16]). 

Let G be a subgroup of S„. For any / E [n], the orbit of i 
under the action of G is defined as the set 

The stabilizer of i under the action of G is defined as 

G, = {gEG\g{i) = i}, 
and is a subgroup of G. Furthermore, 

|G| = [f'^l ■ |G/|. (1) 
Theorem 8. IfC is an [n, M, d) -LMRM subgroup code, then 

M< 

~" (d!)L"/'^J(n mod d)\' 

Proof: For convenience, let us denote r = n mod d, and 
fc = [n/d\. Let us now consider C as it acts on the d-subsets 
of [n]. By ([T]l we get 



M 



ICj 



[l,d] 



c. 



l,d] 



c. 



[14] 



where the last inequality follows from the fact that the orbit 
of [1, d] under C contains at most all the d-subsets of [n] . We 
can take another similar step and get 



M ^ 



C[i,d] 

n — d 
d 



[d + l,2d]Mwi 



C, 



l,d],[d+l,2d] 



c. 



[l,d],[d+\,ld] 



where [d + l,2rf]''l^'''l denotes the orbit of [d + l,2d\ under 
the action of C\^\dY i-^-' the stabilizer of [1, d] under C, while 
C[2 ^1 2d] denotes the subgroup of C stabilizing both [l,d] 
and [d + {,2d\. 
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Reiterating the argument above we reach 



k-l 
i=0 



n — di 



^[l,d][d+l,2d],...,[{k-l)d+l,kd] 



It is now easy to see that 

'^[l,d],[d+l,2d],...,[(k-l)d+l,kd] ^ ' 
or else the minimum distance d of C would be violated. Thus, 



k~l 
i=0 



n — di 
d 



{d\fr\' 



We can strengthen the upper bound of Theorem [8] by 
showing that codes attaining it with equality must also satisfy 
certain divisibility conditions. 

A group G ^ Sn is said to be transitive if for any i, j E [n] 
there is a permutation f E G such that /(/) = By ([T]i, the 
size of such a group G must be divisible by n, since the orbit 
of / under the action of G is [n]. 

Extending this definition, we say a group G ^ S„ is k- 
homogeneous if for any two A:-sets A, B C [n] , there exists 
a permutation f EG such that /(A) = B, where f{A) = 
{f{a) I a E A}. It then follows from ([U, that the size of such 
a group G must be divisible by (^). 

The following theorem was given in [6]: 

Theorem 9. Let G ^ Sn be a k-homogeneous finite group, 
where 2fc ^ « + 1. Then G is also {k — 1) -homogeneous. 

Hence, for a fc-homogeneous group G ^ S„, 2k ^ n + 1, the 
size of the group G is divisible by 



K 



n,k 



Icm 



n 

k-l 



Theorem 10. Let C ^ S„ be an {n,M,d)-LMRM subgroup 
code attaining the upper bound of Theorem^with equality, i.e.. 



M 



Then 



1cm <^ K 



(d!)L"/'*J(«mod dy.' 

n-2d + l 



^n—id,d 



^ f ^ 



M. 



Proof: If we examine the proof of Theorem [H] for C to 
attain the upper bound we must have | [1, d]'- 1 = Thus, for 
any d-subset AC [n] , there exists a permutation f/^EC such 
that ( [1, d] ) = A. It now follows, that for any two d-subsets 
A,B C [n], we have that /b/^^A) = B, and /b/^^eC 
since C forms a subgroup. Hence, C is d-homogeneous. If 
2d ^ n + 1 then by Theorem |9] we have J<C„^ | M. 

Continuing in the same manner, the group C[i^] may be 
viewed as a permutation group over [n — d] by deleting the 
elements of [d] and relabeling the rest. Again, we must have 



[d- 



l,2d]'^M 



which means that C[i ^j] is also d- 



homogeneous. Again, if 2d ^ n — d + 1 then divides 



C, 



l,d] 



but \Ci^ij\ divides |C| since Cji^j] ^ C. Reiterating 



the above arguments proves the claim. 



It is also important to notice that if an (n, M,d)-LMRM 
subgroup code C exists, then M | n! since C ^ S„. 

Example 11. Continuing Example^ we would like to find an 
upper bound to LMRM subgroup codes of length n = 6 and 
minimum distance d = 3. 

We first substitute n and d in the ball-packing bound of 
Theorem |7] We get an upper bound (not only for subgroup 
codes) of [6!/13J = 55 since the size of a ball of radius 1 
in Sg equals 13. 

Setting n = 6 and d = 3 in Theorem [8] we get an upper 
bound of size 6!/(3!)2 = 20. If a {6,20, 3)-LMRM subgroup 
code exists, then by Theorem\lU\its size must be divisible by its 
length (since it must be 1 -homogeneous). However, 6 does not 
divide 20, and the next candidate for an upper bound, 19, does 
not divide 6! = 720. Thus, the resulting upper bound is 18. 
This makes the (6, 18, 3) -LMRM subgroup code from Example 
\5\optimal. □ 

B. Codes and Anticodes 

We turn to describe another powerful bounding technique. 
The resulting bounds bear a striking resemblance to the code- 
anticode method of Delsarte [11] and the set-antiset method of 
Deza [13]. However, both methods are not directly applicable 
to the case at hand. 

Given a metric space with integer distances, we can con- 
struct a graph whose vertices are the points in the space, 
and an edge connects two vertices if and only if they are at 
distance 1 from each other We call this the induced graph of 
the metric. If the metric distance between any two points in 
the space equals the length of the shortest path between the 
corresponding vertices in the induced graph (i.e., the distance 
in the graph), we say the metric space is graphic. 

The code-anticode method of Delsarte requires a graphic 
metric space which forms a distance-regular graph. In our 
case, the ^co-metric over S„ is not even graphic, and hence the 
code-anticode method does not apply. The set-antiset method 
requires a metric over S„ which is both right and left invariant. 
Again, the £oo metric-fails to meet the method's requirements 
since it is not left invariant. 

Given a set A C S,,, we denote 

V{A) = {d{f,g)\f,gEA}. 
We also denote the inverse of A as 

A-i = {/-i I/ga}. 

Definition 12. Two sets. A, B C Sn are said to be a set and an 
antiset if 

V{A)nV{B) = {0}. 

The following is the set-antiset bound for right-invariant 
metrics over S„. 

Theorem 13. Let d : Sn x S„ N U {0} be a distance 
measure inducing a right-invariant metric. Let A, B C S„ be 
a set and an antiset. Then 

lAI ■ IBI \S„\ = n\. 
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Proof: It is obvious that 

We contend that \A-'^B\ = \A-'^ \ ■ \B\ = \A\ ■ \B\. Let 
us assume the contrary, i.e., that there exist /i,/2GA and 
gl,g2&B such that /f^gi = f2^g2 but not both /i = /2 
and gi = gi- 

In that case, it follows that gig2^^ = flfi^- have, 
d{gi,g2) = d{gig2-\i) = d{f,f^\i) = d(/i,/2). 
But then 

d{gi,g2) = d(/i,/2) e 2?(A) n I?(B) = {0} 

implying that gi — g2 and fi = /2, a contradiction. ■ 
To apply the set-antiset method to LMRM codes we need 
the following definition. 

Definition 14. A limited-magnitude rank-modulation anticode 
(LMRM-anticode) with parameters [n, M, d), is a subset A C 
Sn of cardinality M, sucli that rfoo (/, g) ^ d for all f,g&A. 

Theorem 15. Let C be an {n, Mq, d)-LMRM code, and let A 
bean {n,MA,d- 1) -LMRM anticode. Then Ma Mq ^ n\. 

Proof: By the definition of a code and an anticode it is 
easily seen that 'D{A) H T>{C) = {0}. The claim is then a 
direct consequence of Theorem [13] ■ 
Theorem [15] generalizes previous results. It may be easily 
verified that a ball of radius [{d — 1)/2J centered about the 
identity permutation / is an {n,d — 1)-LMRM anticode. Thus, 
the ball-packing bound of Theorem [7] is a special case of 
Theorem [15] 

The following is a generalization of Theorem [8] to LMRM 
codes which are not necessarily subgroups. 

Theorem 16. If C is an{n,M,d) -LMRM code, then 

nl 



M ^ 



(d!) mod d)[' 



Proof: We construct the following {n,M',d- 1)-LMRM 
anticode A: Let us denote 

Ai = {[l,d] + {i-l)d)n[n]. 

We now define the anticode A as 



A = Sa^x Sa^x ■■■ X Sa. 



n/dy 



It is easy to verify that A is indeed an anticode of maximum 
distance d — 1, and that its size is 

M' = (d!)L"/''J(nmodd)!. 

By Theorem [TSl M ■ M' ^ n\, and the claim on the maximal 
size of an LMRM code follows. ■ 
It should be noted that Theorem [16] does not make Theorem 
[8]redundant, since through the proof of the latter we were able 
to provide stricter necessary conditions for potential subgroup 
codes attaining the bound with equality, as seen in Theorem 

m 

The next obvious question is: What is the size of the 
maximal size of an {n,d — 1)-LMRM anticode? 



Theorem 17. Let A be an {n,M,d - 1)-LMRM anticode. 
ThenMi^ (d!)"/'^. 

Proof: For all 1 ^ z ^ « let = {/(;) | / e A}. It is 
easy to see that | z'"^ | ^ d, otherwise there would exist f,g & A 
such that |/(z) — g{i)\ ^ d which contradicts that maximal 
distance of A. 

Let P be the following n x n binary matrix, where Pj j = 1 
iff there exists f E A such that /(z) = j, otherwise P, y = 0. 
It is well known (see for example [27]) that 

\A\^peT{P)= ^ nP,^(,) 

/6S„1=1 

since all summands are either or 1, and every permutation 
in A corresponds to a non-vanishing summand. 

According to Bregman's Theorem (see [5]), for any n x n 
binary matrix P with r, I's in the z'-th row 

per(P)j^n(r,!)i 

1=1 

In our case, every row of P contains at most d I's. We can 
certainly change some O's into I's in P so that every row 
contains exactly d I's, and by doing so, only increase the 
value of per(P). It now follows that 

M = |A| per(P) ^ (rf!)"/'^. 

■ 

Thus, for the case of d\n we have an optimal anticode: 

Corollary 18. The anticode constructed as part of Theorem\W\ 
is optimal when d\n. 

When d does not divide n the anticodes constructed in 
the proof of Theorem [16] are not necessarily optimal. The 
following theorem shows we can build larger anticodes. 

Theorem 19. Let us denote r = n mod d. Then there exists an 
{n, M' , d - 1)-LMRM anticode of size 



M' 





[i\-\d-r)\ 


d+r 
2 


! 


d+r~\ I 
2 • 


( 




d+r 
2 







Proof: Consider the following (d + r) x (d + r) binary 
matrix P: 



P = 



ir(rf+i-)/2ixd 


0\{d+r)/2]xr 


0[(d+i-)/2Jxr 


l[(<i+r)/2Jxd 



where l,x; (respectively, 0/x/) denotes the all I's (respectively, 
all O's) matrix of size i x j. It may now be verified that 



r-l 

per(P) = {d-r)\Yl 

1=0 



d + r 
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We now construct the following n x n binary matrix Q: 



Q 



\ 











V 









P 



J 



where along the diagonal we have [^J — 1 blocks of Idxd- 

All the rows contain a contiguous block of I's of size d, 
and thus, all the permutations contributing to per(Q) form an 
anticode of maximum distance d — 1. It can be easily seen 
that 

M' = per(Q) = (d!) LttJ per(P), 

as claimed. ■ 
With these anticodes we get the following two theorems. 

Theorem 20. If C is an{n,M,d) -LMRM code, then 



M ^ 



d+r 
2 



d+r 
2 


1-) 


! 


d+r 
2 


! 


d+r 
2 


I 



(rf!)LiJ-i(d-r)! 
where r = n mod d. 



Proof: Simply use the size of the anticodes of Theorem 
[191 with Theorem [T5] ■ 

Theorem 21. The optimal {n,n - 1) -LMRM code, n ^ 3, has 
size 3. 

Proof: By Theorem |20] we have the following upper 
bound on the size of {n,n — 1)-LMRM codes: 



n-l 



n even 



(«-2)!L5jr5l nodd 

and since the size must be an integer, it cannot exceed 3. Such 
a code can be easily constructed for any n ^ 3 and is simply 
the cyclic group of order 3 on the coordinates {1,2, n}: 

C = {i, {1,2, n),{l,n,2)} 

given in cycle notation. ■ 
On a side note. Theorem 1211 was also shown in [20] using 
ad-hoc arguments. Whether other infinite families can be 
shown to be optimal using these anticodes is still unresolved. 



C. Asymptotic Bounds 

Some of the constructions and bounds presented in previous 
sections take on a simple asymptotic form, which we explore 
below. We will compare the resulting asymptotic bounds with 
those implied by the previous constructions of [24]. 

Deflnition 22. Given an {n, M, d) -LMRM code, we say it has 



rate R = 



and normalized distance S 



A slight peculiarity arises here: One might expect the rate 
of a code to be defined as and not = 

since the ambient space S„ is of size n[. However, doing so 
results in asymptotic bounds equal to 0. 



We begin with the asymptotic form of Theorem |20l and 
remind that the binary entropy function H2 : [0, 1] ^ [0, 1] is 
defined as 



-plog2p-(l-p)log2(l-p). 



Theorem 23 . For any {n,M,d) -LMRM code. 



R^is[l 
■2-26 



5 ) log2 
-0(1). 



H2 U 



Proof: According to Theorem l20l 



M ^ 



d+r 



d+r 



r ! 



(d!)L3j-i(d-r)! 



d+r 
2 



d+r , 
2 



where r = n mod d. Moving to the R and S notation and 
slightly simplifying the expression we get 



((^5«)!)LiJ-i 



2 



2 



(n (^S-1+S[l\ 



At this point we use the well-known Stirling's approximation, 
ml = \/2TTm{m/e)"'{l + 0{l/m)). After rearranging we get 

2(2-241J)" 



2Rn ^ 



(m-41J)n 



We take log2 of both sides, divide by n, and do some 
rearranging to reach 



R ^ 2 - 2^5 

S + l-S 

6 

■2-25 



6 ] log2 
-0(1) 



- ^ j log2 S 

log2 (^5 + 1- S 
i -l\+H2[5 



0(1) 



as claimed. ■ 
For the next two asymptotic forms we need an estimate on 
the size of a ball in the i'co-norm. While for any fixed radius 
r, tight asymptotic bounds on |-B,.^„| are given in [27], we 
require an estimate for r = 0(n). The best estimate, to our 



knowledge, for ^ r ^ 

\^r,n\ ^ 



was given in [19]: 
V'27Tn /2r + l\" 



22r 



e 

B-2r 



(2) 



\Br,n\ ^ ((2r + l)!)^ n (3) 

i=r+l 
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For our purposes, however, we do require an upper bound on 
|Br,«| for the entire range ^ r ^ « — 1. Therefore, we 
present an augmentation of Q in the following lemma. 

Lemma 24. For allO ^ r ^ n — 1, 

2 



l((2r + l)!)5rSnf:,+i(i!)^ 

'^r.n ^ S 2i-+2-n -, 2 

l(n!)^n:L",.Vi(^-!)^ - 

Proof: It is easily seen that Br „(/) is the set of all 
permutations corresponding to non-vanishing terms in per(A) 
where A is the binary banded Toeplitz matrix defined by 
Aij = 1 iff |f — / I ^ r. This observation has been used both 
in '[27] and in [19]. 

The upper bound is immediately derived by using 
Bregman's Theorem. For example, for ^^-^ ^ r ^ « — 1, 
the matrix A has 2r + 2 — « rows with n I's, and two rows 
with i I's for each r + l^i^n — 1. ■ 

We now state the asymptotic form of the Gilbert- 
Varshamov-like bound of Theorem |6l 

Theorem 25. For any constant < i5 ^ 1 there exists an 
infinite sequence of (n, M, d) -LMRM codes with | ^ <5 and 

rate R = satisfying R ^ /gv(^) + where 

'log2l+2^^(log2e-l)-l 
-2(51og2i+2(l-(5)log2e 

Proof: By Theorem |6] we are guaranteed the existence of 
an {n,M, d)-LMRM code of size M ^ n\/ \Ba-i^„\. We can 
now use Lemma l24l and replace |Bj_i „| with an appropriate 
upper bound. 

Suppose '-^^ ^d — l^n — 1 (the proof for the other case 
is similar). Then by Lemma l24l 



0€r ^ 



n-l 



n-1 



< r < n 



< ^ ^ i 
5 ^ ^ ^ 1 



IB, 



Sn—l,n I 



i=Sn 

„(2<J-l)n ^ („-l)! 

{5n-l)\ 

20{n) 



We now have 



2Rn ^ 



nl 



\Bd-l,n\ 

Taking log2 of both sides and dividing by n completes the 
proof. ■ 
The ball-packing bound of Theorem |7] has the following 
asymptotic equivalent: 

Theorem 26. For any {n,M,d) -LMRM code, 

R^3 + \og2] + o{l). 

Proof: The bound of Theorem |7] together with the lower 
bound of (|2]l becomes. 



M ^ 



[{d-l)/2\,n 



V2^ \d' + l 



where d' = d — 2 + {d mod 2). Changing to the R and S 
notation, using Stirling's approximation, and then taking logj 
and dividing by n gives as 

Ri^3 + \og2^+o{l), 

as desired. ■ 
Finally, we analyze the asymptotics of the codes produced 
by Construction [T] 

Theorem 27. For any constant < ^ ^ 1, Construction \l\ 
produces codes of rate 



R 





1 




"1" 






) log2 ( 






_s_ 




s 



+ s + s 



1 log2 



Proof: For any {n, M, d)-LMRM code produced by Con- 
struction [T] we know that 

M = ( In/d] !)" ^ ( [n/d\ '^^ . 

Just like before, we change to the S and R notation: 



"(i-4)J) 



«(^+4}j-i) 



We then take log2 of both sides, and divide by n to reach the 
claimed result. ■ 
All the asymptotic bounds are shown in Figure [T| Several 
interesting observations can be made. First, the ball-packing 
bound of Theorem |26] is weaker than the code-anticode bound 
of Theorem |23] This, however, may be due to a poor lower 
bound on the size of a ball from (|2]i. It was conjectured in [19] 
that this lower bound might be improved substantially. We also 
note that Construction [T] produces codes which asymptotically 
out-perform the Gilbert- Varshamov-like bound of Theorem |25] 
for a wide range of S (with crossover at S ^ 0.34904), and 
appear to be quite close to the bound otherwise. Again, this 
might be a result of a weak upper bound on the size of a ball. 
Finally, the codes presented by [24] are severely restricted 
since they are derived from binary codes in the «-cube, and 
as such, are bounded by the «-cube versions of the Gilbert- 
Varshamov bound and the MRRW bound (see, for example, 
[25]). 

V. Conclusion 

We have studied codes for the rank modulation scheme 
which protect against limited-magnitude errors. We presented 
several code constructions which, in some cases, produce 
optimal codes. The codes constructed can also be encoded and 
decoded recursively, while the code of Construction [T] may be 
encoded/decoded directly using a simple procedure with small 
loss in rate. We note that all the constructions we presented 
create codes which are subgroups of S„. 

We also explored bounds on the parameters of these codes. 
The strongest upper bound appears to be the code-anticode 
bound of Theorem[T6] In the asymptotic study of these bounds, 
the simple code from Construction [T] shows a better rate than 
the one guaranteed by the Gilbert- Varshamov-like bound of 
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Figure 1. (a) The Gilbert- Vai'shamov bound in the «-cube (b) The MRRW bound in the n-cube (c) The rate of the code from Construction [T] (d) The 
Gilbert- Varshamov-like bound of Theorem 1251 (e) The code-anticode bound of Theorem 1231 (f) The ball-packing bound of Theorem 1261 



Theorem |25] and the ball-packing upper bound of Theorem 
|26] is always weaker than that of the code-anticode bound of 
Theorem |23] Both, however, may be a result of a loose bound 
on the size of a ball in the i'co-metric. 
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